The determinant bound for discrepancy is almost tight 
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1 Abstract 

In 1986 Lovasz, Spencer, and Vesztergombi proved a lower bound for the hereditary 
discrepancy of a set system T in terms of determinants of square submatrices of the 
incidence matrix of F . As shown by an example of Hoffman, this bound can differ from 
herdisc(.F) by a multiplicative factor of order almost log n, where n is the size of the ground 
set of J 7 . We prove that it never differs by more than 0((log ro) 3 / 2 ), assuming \F\ bounded 
by a polynomial in n. We also prove that if such an F is the union of t systems F\ , . . . , Ft , 
£\\ • each of hereditary discrepancy at most D, then herdisc(J-") < 0(v^(logn) 3 / 2 D). For t = 2, 

this almost answers a question of Sos. The proof is based on a recent algorithmic result 
of Bansal, which computes low-discrepancy colorings using semidefinite programming. 

o 

1 Introduction 

Let V = [n] := {1,2,..., n} be a vertex set and F = {F\, F^, ■ ■ ■ , F m } be a system of subsets 
of V. The discrepancy of T is disc(J-") := min x disc (J 7 , x)> where the minimum is over all 
colorings X-V ^ and disc(J",x) := max i= i i2 ,..., m |Ej g F 1 x(j)\- 



The hereditary discrepancy of J- is 



herdisc(J-") := maxdisc(J r | j) 



jcv 

Here F\ j denotes the restriction of the set system T to the ground set J, i.e., {F(l J : F G F}. 

Bounding the discrepancy or the hereditary discrepancy of a particular set system from 
below is usually challenging. One of the strongest known tools is a result known as the 
determinant lower bound. To formulate it we define, for a real matrix A, 

detlb(A) := max max | det B\^ k , 

where the maximum is over all k x k submatrices B of A. For a set system F, we put 
det lb (J 7 ) := detlb(A), where A is the incidence matrix of F. 

Theorem 1 (Lovasz, Spencer, and Vesztergombi [LSV86]) For every (finite) set sys- 
tem F we havE herdisc(J") > idetlb('F). 

* Partially supported by the ERC Advanced Grant No. 267165. 
The bound in LSV86] is stated without the | factor. This has two causes: first, their discrepancy is scaled 
by 5 compared to ours, and second, in their argument, at one step they seem to be multiplying by 2 where, in 
my opinion, one should divide by 2. 
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Lovasz et al. [LSV86] conjectured that the determinant lower bound is tight up to a 
constant factor, i.e., herdisc(J r ) = 0(detlb(J r )) for all T. This was refuted by an example 
of Hoffmann^ which shows that herdisc(J r )/detlb(J 7 ) can be of order (log n)/ (log log n). A 
construction of Palvolgyi [PallO] . also presented in Section [5] below, provides the slightly 
stronger lower bound of fi(logn) for the same quantity. 

Here we prove that herdisc(J r )/detlb(J r ) cannot be much larger than in these examples, 
at least if \T\ is bounded by a polynomial function of n. 

Theorem 2 For every set system J- 



Next, we consider the situation where a set system T as above is a union of set sys- 
tems T\, J~2, ■ ■ ■ , Tti and we are interested in bounding herdisc(J-") in terms of the hereditary 
discrepancies of the T{. 

For t = 2, this problem was raised by Sos (it is cited, e.g., in [LSV86J). She asked whether 
herdisc(J-"i U T2) can be estimated in terms of herdisc(J-i) and herdisc(J r 2) for any two set 
systems T\ and J~2 (on the same vertex set). Hoffman's example mentioned above shows that 
herdisc(J r i U F2) cannot be bounded by a function of herdisc(J-"i) and herdisc(J r 2) alone. 

The next theorem shows that a good bound is possible if we also allow for a moderate 
dependence on m and n. Namely, herdisc(J 7 i LU2) can exceed max(herdisc(J r i), herdisc(J r 2)_) 
at most by a factor polylogarithmic in n and m, not much more than in Hoffman's example|f| 

The only previous result in this direction, from [KMV05], shows that if T2 consists of a 
single set, then herdisc(J r i U T2) = O (her disc (J^i) log n). 

Theorem 3 Let J 7 be a system of m sets on n vertices, and let J- = T\ U T2 U • • • U Tt ■ Then 



There exist systems J- of n sets on n points with discrepancy of order ^/n (e.g., systems 
derived from Hadamard matrices or random set systems; see, e.g., [MatlOj ). If we let J 7 , be 
the set system consisting of the ith set of such ann,i= 1, 2, . . . , n, then herdisc(J-~j) = 1, while 
herdisc(J-") = Q(y/n). In this sense, the bound in Theorem [3] is tight up to a polylogarithmic 
factor, including the dependence on t. 

Theorem [3] is an immediate consequence of Theorems [T] and [2] and of the next linear- 
algebraic lemma. 

Lemma 4 Let Ai, . . . , At be real matrices, each with n columns, let D := maxj = i^ r ..,( detlb(-Aj), 
and let A be a matrix in which each row is a row of some of the Ai . Then 



2 The vertex set in the example is the set of edges of the complete fc-ary tree T of depth k (so n ~ k k ). The 
set system T is a union T\ U Ti, where T\ consists of the edge sets of all root-to- leaf paths, and Ti contains, 
for each non-leaf vertex v, the set of the k edges connecting v to its successors. We have herdisc(J-i) < 1, 
herdisc(J" 2 ) < 1, detlb(.F) = O(l), and disc(.F) = k w (log n)/ (log log n). See, e.g., |BS95| or |MatlO| for more 
details. 

3 A connection of Sos's question to the tightness of the determinant lower bound was observed in |LSV86) . 
although with a different proof, which would yield a quantitatively weaker result in our setting. 



herdisc(J') < detlb(J r ) • 0(log(mn)y / i og n 




herdisc(J 7 ) < ( max herdisc(J 7 j)) • 0( Vt log(mra)i/log n 



detlb(A) < Dy/et. 
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In case where some of the detlb(j4j) are much smaller than the others, for example, one 
can obtain a somewhat better upper bound by making finer estimates in the calculation in the 
proof. However, a general formulation of such a finer bound looks cumbersome, and it seems 
that in such similar improvement can usually be achieved by applying the lemma 

repeatedly in several stages with various values of D. 

The bound in Lemma[His generally tight up to a constant factor; this can be seen from an 
example similar to the one mentioned below Theorem [3l Namely, let A be an n x n Hadamard 
matrix (i.e., a matrix with pairwise orthogonal rows and with ±1 entries, which is well known 
to exist for infinitely many values of n), and let A{ be the single-row matrix made of the ith 
row ofi,i = l,2,...,n. Then, obviously, detlb(Aj) = 1, while detlb(^) > det(A) 1 / n = ^fn. 
Moreover, if we partition the rows of this A into t blocks A\, . . . , At by n/t rows each, then 
the Hadamard bound, stating that the determinant of a matrix is at most the product of 
the Euclidean norms of the rows, implies detlb(^4j) < sjnjt. This shows the tightness of 
Lemma [3] for all t < n. 

Vector discrepancy and Bansal's algorithm. The proof of Theorem is based on 
a recent breakthrough — an algorithm of Bansal [BanlOj. The algorithm produces a low- 
discrepancy coloring of a given set system, using semidefinite programming and a clever 
randomized rounding strategy. To state the consequence of Bansal's work that we will use, 
we first introduce another notion of discrepancy. 

The vector discrepancy of the set system J 7 , denoted by vecdisc(J-"), is the smallest D > 
for which there exist unit vectors ui , . . . , u n G M. n such that 



< D, i = 1,2, ... , m, 

where || • || is the Euclidean norm. So, for vector discrepancy, one colors by unit vectors 
instead of dbl's. Since a ±1 coloring can also be regarded as a vector coloring by the vectors 
ei = (1, 0, . . . , 0) and — ej, we have vecdis^J 7 ) < dis^J 7 ). 

The hereditary vector discrepancy hervecdisc(J-*) is the maximum vector discrepancy of a 
restriction of J 7 to a subset J C V . 

Bansal's algorithm yields the following. 

Theorem 5 (Bansal [ BanlO] ) We have 

herdisc(J-") = 0(log(mn)) • hervecdisc(J-") 

for all J- . 



We conjecture that the claim of Theorem [5] actually holds with ^J\og(mn) instead of 
log(mn). If true, this would yield a similar improvement in Theorem [2] and get close to an 
asymptotically optimal bound, at least assuming m bounded by a polynomial in n. 



2 A dual formulation of the vector discrepancy 

Lemma 6 We have vecdisc(J-") > D if and only if there are nonnegative reals wi, ... ,w m 
with Ym^i w i — 1 an d re ^s z\,...,z n with Sj=i z j — D 2 such that for all x G M. n , 

m , \ 2 n 

i=i S'eFi ; j=i 
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Proof. We will use the duality of semidefinite programming. Dualizing a semidefinite 
program is a routine procedure, but unfortunately, I am not aware of an explicit recipe for 
the general case in the literature. Rather than converting the relevant semidefinite program 
to a standard form, it seems more convenient to use a duality theorem for conic programing 
from Duffin |Duf56| . which we now introduce. 

Let V, W be real vector spaces (for our purposes we may assume that they are finite- 
dimensional) , each with a scalar product, which we denote by (•, •) in both cases. Let K QV 
and L C W be closed convex conesQ let b G W and c G V be vectors, and let F: V — > W be 
a linear map. We consider the primal cone program (P) 

maximize (c, subject to b — F(£) G L, £ G K 

and the dual cone program (D) 

minimize (h,u) subject to F T (u>) — c G K* ,u> G L*. 

Here F T : W — > V denotes the adjoint of F; if we fix orthonormal bases in V and W, then 
the matrix of F T is the transpose of the matrix of F. The duality theorem asserts that if the 
maximum in (P) is a finite number 7 and if the set of feasible solutions of (P) has an interior 
point, then (D) is feasible as well and its minimum equals 7. (Since (P) and (D) are dual to 
one another, one can also interchange their role in the theorem.) 

In our case, we start with a vector program defining vecdisc(J r ), namely, 

minimize t subject to || YljeF t uj\\ 2 < t, i = 1, 2, . . . , m, ||ui || = ■ ■ • = ||u n || = 1, 

and we convert it to an equivalent semidefinite program in a standard way. We introduce a 
variable Q, which is an n x n matrix with q^j = ufuj (the Gram matrix of the Uj); as is 
well known, Q is of this form for some vectors ui, . . . , u n exactly if Q G PSD ra , where PSD n 
denotes the cone ofnxn positive semidefinite matrices. 

For the constraint || YljeFi u ill 2 — *j we ex P an d the left-hand side to ^ • k&F . uju^, and 
then it translates to 

t - (a<af ) • Q > 0, (2) 

where aj is the ith row of the incidence matrix A of J-, regarded as an n x 1 matrix (so a^a^ is 
aiiJixn matrix), and • denotes the scalar product of matrices (given by X • Y = J2i j x ijVij)- 
The constraint ||uj|| = 1 then reads 

Ej»Q = 1, (3) 

where Ej is the matrix with 1 at position and 0s elsewhere. 

This semidefinite program can be regarded as the cone program (P) with V = SYM n © R 
(where SYM n denotes the vector space of all symmetric n x n matrices with the • scalar 
product), W = R m © R n , K = PSD n © [0,oo), and L = [0,oo) m © {0 n } (where n denotes 
the n-component vector of 0s). We write the unknown £ G V in the form (Q, t); then we have 
F(Q,t) = (aiaf •Q — t,... ,a m a^ • Q - t,E\ • Q, . . . ,E n • Q). Finally, c = (0 nxn , -1) and 
b = (0 m ,l n ). 

4 A convex cone is a convex set K such that x G K implies Ax £ K for all A > 0. The dual cone of K is 
K* = {y : (x, y) > for all x 6 K}. A simple property we will often use is (K © L)* = K* © L* , where © 
denotes direct sum; we assume K C V, L C W , where V, W are disjoint vector spaces, and K ffi L C V © W . 
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As for the dual cone program, it is well known that PSD* = PSD n , so K* = K, and it 
is easily checked that L* = [0, oo) m © R n . Let us write the variable u in (D) in the form 
(w, -z), w G R m , z G R n . Then (D) becomes 

maximize J2]=i z j subject to YT=i w i a i a f ~ Yl]=i z j E j € PSD n, 

1 - XX 1 w 3 ^ °> > • • • > w m > 0. 

Let M stand for the matrix YllLi w i Si i Si J ~Yl 7 j=i z j E j- Then the condition of positive semidef- 
initeness of M means x T Mx > for all x G M n , and it is easy to verify that this can be 
rewritten as the inequality ([I]) in the lemma. 

It remains to verify that the duality theorem can be applied to these (P) and (D). We will 
check that (D) is bounded and has a feasible interior point. To verify boundedness, which 
means that Yuj=i z j cannot be arbitrarily large, we use ([1]) with x = l n : then the left-hand 
side is bounded, and so Y^=i z j ^ s bounded as well. 

For an interior feasible point, we can take, e.g., w\ = ■ ■ ■ = w m = Jj-, z\ = ■ ■ ■ = z n = —1. 
Then ([TJ obviously holds for all x. 

Thus, the duality theorem applies and shows that if vecdisc(J-~) > D, then the maximum 
in Q is at least D 2 . This yields the existence of the desired u>i and zj, and the lemma is 
proved. □ 

3 Proof of Theorem [2] 

We begin with a simple and probably standard lemma. 

Lemma 7 Let y G M. n be a vector. Then there exists a subset K C [n] of indices such that 
y is "almost constant" on K, in the sense that, for some t > 0, we have t < \yj\ < 2t for all 
j G K , and 



\y[K]\\>n 



ylogn, 

where y[K] denotes the \K\- component vector (yj : j G K). 

Proof. Let y max := max^ \yj\, and for i = 0, 1, 2, . . ., let K { := {j : \yj\ G (2 _i_1 y max , 2~ % y max ]}. 
The contribution to ||y|| of the components of y with indices in Ki for i > 21ogn, say, is neg- 
ligible, and so there exists some i$ for which YliGK 2/? = ^(l|y|| 2 / 1°§ n )- Then K := K,- L0 
will do. □ 
Theorem [2] will follow from Bansal's result (Theorem [5]) and the next lemma. 

Lemma 8 Let T = {F\, . . . ,F m } be a set system on [n] with vecdisc(J r ) = D. Then 
detlb(J 7 ) = VL{D/^gn~). 

Proof. We begin with the dual formulation of vector discrepancy from Lemma [6l For more 
convenient notation, we will write the nonnegative weight wi as [if. Moreover, we let J C [n] 
consist of the indices j with Zj > 0, and we will use the inequality ([1]) in Lemma [6] only for 
vectors x that are zero outside J. Writing Zj = 7? for j G J, we arrive at the inequality 

m * \ 2 n 
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for all x G R J , where \\(3\\ < 1 and [| > D. 

Next, using Lemma [7] for y = -y, we select K C. J with = Q(D/y/log n) and 

with 7 [if] almost constant (within a factor of 2). Setting fc := \K\ and £) := i H-y [^fC] || , 
the quadratic average of jj over j E K equals 2D/y/k, and so jj > D/Vk for all j G K. 
Therefore, restricting ([5]) to vectors x with Xj = for j G" K, we obtain 

- / \ 2 n 2 

£a 2 ( £ **) (6) 

i=l Vjg^nx y jGE" 

for all x G 

Let C := A[*,if] be the m x k incidence matrix of the system T\k (consisting of the 
columns of A whose indices belong to K), and let C be the m x k matrix obtained from C 
by multiplying the ith row by Then © can be rewritten as 

x r C T Cx.= IICxIl 2 > — llxll 

fe 

for all x G M k . 

This, by the usual variational characterization of eigenvalues, tells us that the smallest 
eigenvalue of the k x k matrix C T C satisfies X m i n (C T C) > D 2 /k. Then, since the determinant 
is the product of eigenvalues, we also have det(C T C) > (D 2 /k) k . The Binet-Cauchy formula 
then asserts that 

det(C T (7) = ^det((7[/,*]) 2 , (7) 

where the summation is over all /c-element subsets I C [m] and C[I, *] consists of the rows of 
C whose indices lie in I. 

We have det(C*[/,*]) = det(C[I, *]) ]J ieI fa. Setting M := max/ | det(C[I, *])|, we can 
rewrite the right-hand side of (J7|) and estimate it as follows: 

^det((7[/,*]) 2 = ^det(C[/,*]) 2 n^ 2 

< ^ 2 En^ 

i iei 

k\ ~ k\ ' 

where the penultimate inequality follows because every term Yiiei Pf occurs k\ times in the 
multinomial expansion of (/3 2 + • • • + /3^) fc . 

Letting B := C[I, *] for an / maximizing | det C[I, *]|, we thus have det(B) 2 > kl det(C T C) > 
k\(D 2 /k) k > (k/e) k {D 2 /k) k = n(D) 2k = ^1{D / ^/]agn) 2k . So the k x k matrix B witnesses 
detlb(J-") = Q,(D /y/\ogn), and the lemma is proved. □ 

Proof of Theorem [2J. By Theorem [SJ there is a subset J C [n] with vecdisc(J r | j) = 
f2(herdisc(.F)/log(mn)). Theorem [2] follows by applying Lemma [8] to F\ j . □ 
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4 Proof of Lemma |4] 



Let us consider akxk submatrix B of the matrix A, and let I\, . . . , It be index sets partitioning 
[k] such that B[l£, *] is a submatrix of A^, £ = l,2,...,t (we also admit = 0). Let ki := 

We now apply the Gram-Schmidt orthogonalization to the row vectors of each B[It,*], 
separately for each t. We obtain a new matrix B such that the rows of each B[l£,*] are 
orthogonal vectors. Let bj be the ith row of B. 

Since the orthogonalization proceeds by elementary row operations, which do not change 
the determinant, we have det B = det B. We estimate det B using the Hadamard bound 

k t 

| det B | = |detS| < J]jb;|| = JJ JJ ||b<||. (8) 

i=i i=\ i£i e 

Let us fix I for a moment. In order to estimate n^e^ l|b»ll> we consider the quantity 
det(B[l£, *]B[Ii, *] T ). On the one hand, since the bj are mutually orthogonal for i E fy, the 
matrix B[Ie, *]B[It, *] T is diagonal with the entries ||bi|| 2 , i 6 h, on the diagonal, and so 

det(B[I l ,*]B[I e ,*] T ) = Yl\\b i \\ 2 . (9) 

ieii 

On the other hand, by the Binet-Cauchy formula, we have 

det(B[I e ,*]B[h,*] T )= Y, det(B[h,J)) 2 = ^ deb(B[I t , J]) 2 , (10) 

JQk],\J\=k e JQk],\J\=k e 

the last equality holding again because B[Ii, J] is obtained from B[Ig, J] by elementary row 
operations which preserve the determinant. 

Putting (JHj), ([9]), and (fTUj) together, we arrive at 

det(B) 2 <f[ £ det(B[I e ,J]f < f[ (Mdetlb(^) 2 ^ < D 2k f[ 

e=i jc[k],\j\=k e i=i ^ l ' i=\ 

since each B[I#, J] is & ke x ki submatrix of A(. 

Then we estimate, using the concavity of the function x i— > xln(l/x) and Jensen's inequal- 
ity, 

= e Z^=i fcf ln(efc/fc f ) 
< e t(k/t)ln(et) _ r et sk 

Thus det(£) 1/fc < D^/et as claimed. □ 

Remark. The case t = 2 has a somewhat simpler proof using the Laplace expansion of 
det£>, which asserts that 

det B = ^ sgn(/, J) det (5 [J, J]) det (B [1,7]), 
J 




n : <- n f 
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where the sum is over all |J| -element subsets J C [k], I = [k]\I, and sgn(J, J) E {±1} is a sign 
depending on / and J in a way that is of no concern for us (see, e.g., [BJN83, Theorem 4.3]). 

In our case, we again let Ig be the set of indices of the rows of B that come from Ag, 
I = 1, 2, and kg = \Ig\. We use the Laplace expansion of det B for I = I\. For every J we have 
| det(B[Ji, J])\ < D kl and | det(B[I 2 ,J])\ < D k * by the assumption, and so det 5 < (^)D k < 
2 k D k . Hence det(B) 1 /* < 2D as needed. 

A bound on detlb(yl) for larger t can also be obtained by iterating this argument, but this 
method apparently leads only to detlb(A) = 0(tD). 

5 Palvolgyi's example 

As was pointed out by Palvolgyi (private communication, 2011), his geometric construction in 
[PallO] actually provides a slight quantitative improvement over Hoffman's example. Trans- 
lated to the setting of set systems, the construction yields, for every k > 1, two systems F\,F 2 
of A>element subsets of [n], with n = ( 2 ^) — 1 < 4 fc , such that 

(i) herdisc(J r i), herdisc(7"2) < 1, and 

(ii) under every two-coloring of [n], T\V]T 2 contains a monochromatic set (and consequently, 
disc(J"i U F 2 ) = k). 

Since the construction in [PallO] is presented geometrically, and property (i) is not entirely 
obvious, we provide a short self-contained exposition. 

The construction is inductive, and it requires two parameters, k and I. The inductive 
hypothesis is the following: 

For every k,£ > 1 and a ground set V of n = (t) — 1 elements, there exist 
set systems F\ = Fi(V,k,£), T 2 = J^iV, k,£) on V such that T\ consists of k- 
tuples, Ti consists of ^-tuples, herdisc(J-i), herdisc(J r 2) < 1, and for every red-blue 
coloring of V, there exists a set F £ T\ that is completely red or a set F G F 2 
that is completely blue. 

For k = 1 and I arbitrary, we have n = £, and we take T\ consisting of the I singleton 
subsets of V, while T 2 = {V}. The other base case I = 1 and k arbitrary is symmetric. 

In the inductive step, we take the ground set V and decompose it into three disjoint 
subsets: V with ( k ^^ 1 ) — 1 elements, V" with X ) — 1 elements, and a singleton set 

{p}. We inductively construct F[ := Fi(V',k — 1,£) and F' 2 := F 2 (V',k — 1,£), as well as 
T'l ■= Fi(V", k,£—l) &ndF 2 ' := F 2 (V", k,£- 1). Then we set T x := {FU{p} : F € F[}L>F", 
F 2 := T 2 U {F U {p} : F S F 2 }, and check the required properties by a straightforward 
induction. 

We begin with verifying that, under every red-blue coloring of V, T\ has a completely 
red set or F 2 has a completely blue set. If p is red, then we restrict the coloring on V and 
inductively find a red (k — l)-tuple in F[, which together with p gives a red A:-tuple in Fi, or 
a blue £-tuple in F 2 , which is also a blue ^-tuple in F 2 . The case of p blue is analogous. 

It remains to verify that her disc ( T\ ), her disc (J~ 2 ) < 1. By symmetry of the construction, 
it suffices to check herdisc(J-"i). We inductively prove a slightly stronger statement: for every 
W C V, there is a ±1 coloring x of If such that x{F n W) € {0, 1} for every F G F\. The 
base cases with k = 1 or I = 1 are obvious, and in the inductive step, we inductively color 
W' = V n W and W" = V"t~)W,we flip the colors on W' , and we color p with +1. 
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